Picture for Canyu Chen

Canyu Chen

CLOVER: Closed-Loop Value Estimation \& Ranking for End-to-End Autonomous Driving Planning

Add code
May 14, 2026
Viaarxiv icon

SilLang: Improving Gait Recognition with Silhouette Language Encoding

Add code
Mar 25, 2026
Viaarxiv icon

From Representational Complementarity to Dual Systems: Synergizing VLM and Vision-Only Backbones for End-to-End Driving

Add code
Feb 11, 2026
Viaarxiv icon

Artificial Entanglement in the Fine-Tuning of Large Language Models

Add code
Jan 11, 2026
Viaarxiv icon

Chameleon: On the Scene Diversity and Domain Variety of AI-Generated Videos Detection

Add code
Mar 09, 2025
Figure 1 for Chameleon: On the Scene Diversity and Domain Variety of AI-Generated Videos Detection
Figure 2 for Chameleon: On the Scene Diversity and Domain Variety of AI-Generated Videos Detection
Figure 3 for Chameleon: On the Scene Diversity and Domain Variety of AI-Generated Videos Detection
Figure 4 for Chameleon: On the Scene Diversity and Domain Variety of AI-Generated Videos Detection
Viaarxiv icon

SearchRAG: Can Search Engines Be Helpful for LLM-based Medical Question Answering?

Add code
Feb 18, 2025
Viaarxiv icon

From Generation to Judgment: Opportunities and Challenges of LLM-as-a-judge

Add code
Nov 25, 2024
Figure 1 for From Generation to Judgment: Opportunities and Challenges of LLM-as-a-judge
Figure 2 for From Generation to Judgment: Opportunities and Challenges of LLM-as-a-judge
Figure 3 for From Generation to Judgment: Opportunities and Challenges of LLM-as-a-judge
Figure 4 for From Generation to Judgment: Opportunities and Challenges of LLM-as-a-judge
Viaarxiv icon

ClinicalBench: Can LLMs Beat Traditional ML Models in Clinical Prediction?

Add code
Nov 10, 2024
Viaarxiv icon

Can Knowledge Editing Really Correct Hallucinations?

Add code
Oct 21, 2024
Figure 1 for Can Knowledge Editing Really Correct Hallucinations?
Figure 2 for Can Knowledge Editing Really Correct Hallucinations?
Figure 3 for Can Knowledge Editing Really Correct Hallucinations?
Figure 4 for Can Knowledge Editing Really Correct Hallucinations?
Viaarxiv icon

FMBench: Benchmarking Fairness in Multimodal Large Language Models on Medical Tasks

Add code
Oct 01, 2024
Figure 1 for FMBench: Benchmarking Fairness in Multimodal Large Language Models on Medical Tasks
Figure 2 for FMBench: Benchmarking Fairness in Multimodal Large Language Models on Medical Tasks
Figure 3 for FMBench: Benchmarking Fairness in Multimodal Large Language Models on Medical Tasks
Figure 4 for FMBench: Benchmarking Fairness in Multimodal Large Language Models on Medical Tasks
Viaarxiv icon